Application checkpointing

Results: 180



#Item
131Fault-tolerant computer systems / LAM/MPI / Computer cluster / Open MPI / Live migration / Blue Gene / Application checkpointing / Job scheduler / Message Passing Interface / Computing / Concurrent computing / Parallel computing

Proactive Process-Level Live Migration and Back Migration in HPC Environments Chao Wanga , Frank Muellera , Christian Engelmannb , Stephen L. Scottb b Oak a Dept. of Computer Science, North Carolina State University, Ra

Add to Reading List

Source URL: www.christian-engelmann.info

Language: English - Date: 2012-02-23 11:33:04
132Fault-tolerant computer systems / Concurrent computing / Application programming interfaces / Application checkpointing / Algorithms for Recovery and Isolation Exploiting Semantics / Computer cluster / LAM/MPI / Thread / Garbage collection / Computing / Parallel computing / Computer programming

Hybrid Checkpointing for MPI Jobs in HPC Environments 1 Chao Wang1 , Frank Mueller1 , Christian Engelmann2 , Stephen L. Scott2 Department of Computer Science, North Carolina State University, Raleigh, NC ([removed]

Add to Reading List

Source URL: www.christian-engelmann.info

Language: English - Date: 2011-05-03 17:32:36
133Solid-state drive / Application checkpointing / Blue Gene / IOPS / Computer cluster / Multi-core processor / Single system image / Computing / Parallel computing / Fault-tolerant computer systems

Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures Min Li1 , Sudharshan S. Vazhkudai2, Ali R. Butt1 , Fei Meng3, Xiaosong Ma2,3 , Youngjae Kim2 , Christian Engelmann2, and Galen Shipman

Add to Reading List

Source URL: www.christian-engelmann.info

Language: English - Date: 2011-05-03 17:32:34
134Computer programming / Charm++ / Multi-core processor / Blue Gene / Application checkpointing / OpenMP / Thread / Supercomputer / Cell / Computing / Concurrent computing / Parallel computing

Charm++ Migratable Objects + Active Messages + Adaptive Runtime = Productivity + Performance Laxmikant V. Kale‡ Anshu Arya, Nikhil Jain, Akhil Langer, Jonathan Lifflander, Harshitha Menon

Add to Reading List

Source URL: charm.cs.illinois.edu

Language: English - Date: 2012-11-20 12:54:11
135Application programming interfaces / Message Passing Interface / Fault-tolerant computer systems / Concurrent computing / Application checkpointing / LAM/MPI / Thread / Kernel / Computing / Parallel computing / Computer programming

Hybrid Checkpointing for MPI Jobs in HPC Environments Chao Wang, Frank Mueller North Carolina State University

Add to Reading List

Source URL: www.christian-engelmann.info

Language: English - Date: 2011-05-03 17:32:33
136Application checkpointing / Scalability / High performance cloud computing / Computing / Exascale computing / Supercomputing

ENERGY LOGO DRAFT - BRITTANIC

Add to Reading List

Source URL: science.energy.gov

Language: English - Date: 2011-04-12 14:42:01
137Computer programming / Message Passing Interface / Thread / OpenMP / Application checkpointing / PM2 / Computer cluster / MPICH / Open MPI / Computing / Concurrent computing / Parallel computing

Supporting Adaptivity in MPI for Dynamic Parallel Applications ∗ Chao Huang, Gengbin Zheng, Laxmikant V. Kal´e Parallel Programming Laboratory University of Illinois at Urbana-Champaign {chuang10, gzheng, kale}@cs.uiu

Add to Reading List

Source URL: charm.cs.illinois.edu

Language: English - Date: 2011-05-01 22:22:41
138Computer architecture / Fault-tolerant computer systems / Concurrent computing / Charm++ / Parallel programming / Application checkpointing / Thread / Virtual Processor / Computer cluster / Computing / System software / Parallel computing

Adaptive MPI Chao Huang, Orion Lawlor, L. V. Kal´e [removed], [removed], [removed] Parallel Programming Laboratory University of Illinois at Urbana-Champaign

Add to Reading List

Source URL: charm.cs.illinois.edu

Language: English - Date: 2011-05-01 22:21:56
139Parallel computing / Data logger / Computing / Technology / Data management / Fault-tolerant computer systems / Application checkpointing / Extensible Storage Engine

Programming Support and Adaptive Checkpointing for High-throughput Data Services with Log-based Recovery Jingyu Zhou Computer Science Department Shanghai Jiao Tong University [removed]

Add to Reading List

Source URL: www.cs.ucsb.edu

Language: English - Date: 2010-06-22 17:03:29
140Linux / Monolithic kernels / Device drivers / Application checkpointing / Kernel / Ring / Device driver / Ioctl / Fault injection / Computing / Computer architecture / System software

Fine-Grained Fault Tolerance using Device Checkpoints Asim Kadav, Matthew J. Renzelmann, Michael M. Swift Computer Sciences Department, University of Wisconsin-Madison {kadav, mjr, swift} @cs.wisc.edu Abstract

Add to Reading List

Source URL: pages.cs.wisc.edu

Language: English - Date: 2013-02-11 11:55:04
UPDATE